NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Performance-limiting metastable intragrain impurity nanoclusters in metal halide perovskites

https://doi.org/10.1038/s41467-025-66856-9

Wang, Weizhen; Hao, Mingwei; Li, Xiaofen; Chen, Du; Zhou, Kun; Guo, Peijun; Zhang, Lijun; Cai, Songhua; Zhou, Yuanyuan (December 2026, Nature Communications)

Full Text Available
KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows

Pan, Zaifeng; Patel, Ajjkumar; Shen, Yipeng; Hu, Zhengding; Guan, Yue; Li, Wan-Lu; Qin, Lianhui; Wang, Yida; Ding, Yufei (November 2026, The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) i)

Full Text Available
WLB-LLM: workload-balanced 4D parallelism for large language model training

Wang, Zheng; Cai, Anna; Xie, Xinfeng; Pan, Zaifeng; Guan, Yue; Chu, Weiwei; Wang, Jie; Li, Shikai; Huang, Jianyu; Cai, Chris; et al (July 2026, Proceedings of the 19th USENIX Conference on Operating Systems Design and Implementation (OSDI))

Full Text Available
Behind Defective Mobile AR Apps: Studying Reviews and Bugs of Android AR Software with Comparison to Prior Bug Studies

Rafi, Tahmid; Zhang, Xueling; Niu, Jianwei; Wang, Xiaoyin (July 2026, The ACM International Conference on the Foundations of Software Engineering (FSE))

Full Text Available
Behind Defective Mobile AR Apps: Studying Reviews and Bugs of Android AR Software with Comparison to Prior Bug Studies

Rafi, Tahmid; Zhang, Xueling; Niu, Jianwei; Wang, Xiaoyin (July 2026, The ACM International Conference on the Foundations of Software Engineering (FSE))

Full Text Available
Revelio: Blurred Images Can Still Disclose Your Identity

Zhai, Haoyu; Wang, Shuo; Naghavi, Pirouz; Hao, Qingying; Wang, Gang (May 2026, Proceedings of The IEEE Symposium on Security and Privacy (IEEE SP))

Full Text Available
Reversing fibroblast-to-myofibroblast transition using surface-engineered nanoparticles to potentially ameliorate fibrotic diseases

https://doi.org/10.1016/j.biomaterials.2025.123829

Yu, Xiao; Smith, Steve; Wang, Congzhou (May 2026, Biomaterials)

Full Text Available
ZipLLM: Efficient LLM Storage via Model-Aware Synergistic Data Deduplication and Compression

Wang, Zirui; Lan, Tingfeng; Su, Zhaoyuan; Yang, Juncheng; Cheng, Yue (May 2026, 23rd USENIX USENIX Symposium on Networked Systems Design and Implementation (NSDI '26))

Modern model hubs, such as Hugging Face, store tens of petabytes of LLMs, with fine-tuned variants vastly outnumbering base models and dominating storage consumption. Existing storage reduction techniques---such as deduplication and compression---are either LLM-oblivious or not compatible with each other, limiting data reduction effectiveness. Our large-scale characterization study across all publicly available Hugging Face LLM repositories reveals several key insights: (1) fine-tuned models within the same family exhibit highly structured, sparse parameter differences suitable for delta compression; (2) bitwise similarity enables LLM family clustering; and (3) tensor-level deduplication is better aligned with model storage workloads, achieving high data reduction with low metadata overhead. Building on these insights, we design BitX, an effective, fast, lossless delta compression algorithm that compresses XORed difference between fine-tuned and base LLMs. We build ZipLLM, a model storage reduction pipeline that unifies tensor-level deduplication and lossless BitX compression. By synergizing deduplication and compression around LLM family clustering, ZipLLM reduces model storage consumption by 54%, over 20% higher than state-of-the-art deduplication and compression approaches.
more » « less
Full Text Available
Two-Scale Analysis of the Electrostatics of Dielectric Crystals: Emergence of Polarization Density and Boundary Charges

https://doi.org/10.1137/25M1747907

Sen, Shoham; Wang, Yang; Breitzman, Timothy; Dayal, Kaushik (March 2026, Multiscale Modeling & Simulation)

Full Text Available
NotebookOS: A Replicated Notebook Platform for Interactive Training with On-Demand GPUs

Carver, Benjamin; Zhang, Jingyuan; Wang, Haoliang; Mahadik, Kanak; Cheng, Yue (March 2026, The ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

Interactive notebook programming is universal in modern ML and AI workflows, with interactive deep learning training (IDLT) emerging as a dominant use case. To ensure responsiveness, platforms like Jupyter and Colab reserve GPUs for long-running notebook sessions, despite their intermittent and sporadic GPU usage, leading to extremely low GPU utilization and prohibitively high costs. In this paper, we introduce NotebookOS, a GPU-efficient notebook platform tailored for the unique requirements of IDLT. NotebookOS employs replicated notebook kernels with Raft-synchronized replicas distributed across GPU servers. To optimize GPU utilization, NotebookOS oversubscribes server resources, leveraging high inter-arrival times in IDLT workloads, and allocates GPUs only during active cell execution. It also supports replica migration and automatic cluster scaling under high load. Altogether, this design enables interactive training with minimal delay. In evaluation on production workloads, NotebookOS saved over 1,187 GPU hours in 17.5 hours of real-world IDLT, while significantly improving interactivity.
more » « less
Full Text Available

« Prev Next »

Search for: All records